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INTRODUCTION: 


In  response  to  the  potential  use  of  viruses  as  biological  weapons,  we  have  established  the  Viral 
Biological-threat  Bioinformatics  Resource  (VBBR)  that  collects,  catalogs,  annotates,  and  analyzes 
genetic  information  related  to  potential  viral  threats.  This  work  expands  upon  available  knowledge 
of  virus  replication,  pathogenicity,  and  virus-host  interactions  on  the  basis  of  individual  protein 
domains,  individual  genes,  and  whole  genomes.  This  information  has  been  made  available  to  the 
scientific  community  to  support  current  research  efforts  to  develop  environmental  detectors, 
diagnostics,  antiviral  compounds,  new  vaccines,  and  animal  models. 

Specifically  we  are: 

1.  developing  a  relational  database  that  supports  the  data  storage,  annotation,  analysis,  and 
information  exchange  goals  of  this  proposal; 

2.  developing  a  world-wide  web  site  that  will  allow  all  information  and  analytical  results  compiled 
and  generated  by  this  work  to  be  publicized  to  the  scientific  community,  providing  rapid  access  to 
specific  information  as  required  by  an  individual  researcher; 

3.  collecting  existing  gene  and  genomic  sequences  for  viral  threat  agents  and  importing  them  into 
the  database  for  subsequent  annotation  and  analysis; 

4.  providing  computer-automated  and  human-directed  annotation  to  confirm  and  update  existing 
biological  information  in  the  sequence  records  for  all  sequences;  and 

5.  performing  a  variety  of  analyses  on  genome  and  gene  sequences  to  provide  additional 
information  on  their  structure,  function,  and  evolution. 

BODY: 

Substantial  progress  has  been  made  in  accomplishing  many  of  the  tasks  outlined  in  the  original 
statement  of  work.  Accomplishments  extend  to  database  construction,  data  population, 
development  and  implementation  of  analytical  and  visualization  tools,  and  publication  of  all 
available  information  on  a  web  site.  The  accomplishments  are  listed  below,  itemized  according  to 
the  original  statement  of  work  and  task  list. 

Task  1.  To  develop  a  relational  database  that  will  support  the  data  storage,  annotation, 
analysis,  and  information  exchange  goals  of  this  proposal.  (Months  1-6) 

•  The  database  has  been  created  based  on  our  previous  work  on  poxviruses  and  has  been 
updated  and  refined  to  better  support  a  wider  range  of  virus  threats.  An  overview  of  the 
database  schema  is  provided  in  figure  1 . 
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Figure  1.  VBBR  Database  Schema 
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Task  2.  To  develop  a  world-wide  web  site  that  will  allow  all  information  and  analytical 
results  compiled  and  generated  by  this  work  to  be  publicized  to  the  scientific 
community,  providing  rapid  access  to  specific  information  as  required  by  an 
individual  researcher.  This  web  site  will  also  provide  the  necessary  security  and 
access  controls  to  ensure  that  only  individuals  and  groups  as  designated  by  the 
granting  agency  (USAMRMC)  can  utilize  these  resources.  (Months  1-12) 

•  The  web  site  has  been  established  and  is  accessible  at  the  url:  http://vbbr.genome.uab.edu. 

•  Login  to  the  web  site  requires  registration.  Following  registration,  a  user  account  is 
established,  and  access  is  secured  via  a  usemame/password.  To  register,  a  user  needs  to 
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logon  to  the  following  web  page:  https ://btd. genome. uab .edu/register/register. asp  using  the 
following  credentials:  Username:  genuab;  Password:  $mgbf2003  and  fill  out  the  form.  The 
user  will  be  vetted  for  access  and  when  approved  will  then  be  notified  via  Email  when  the 
account  is  established. 

•  The  home  page  for  the  VBBR  web  site  is  shown  below  in  figure  2. 
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facilitate  the  development  of  novel  antiviral  therapies  and  vaccines  against  virus  infections 
as  well  as  approaches  for  environmental  detection  of  virions  and  the  rapid  diagnosis  of 
disease. 
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The  Center  provides  a  relational  database  that  supports  data  storage  of  virus  genomic 
sequences,  and  annotation  and  analysis  of  virus  genes;  web-based  data  mining  and 
sequence  analysis  tools;  software  for  analysis  of  complete  genomes;  and  a  discussion 
forum. 


The  Viral  Biothreat  Bioinformatics  Resource  Center  was  created  by  a  consortium  of 
laboratories.  An  Advisory  Committee  has  been  established  to  help  guide  the 
development  of  this  Center.  Funding  for  this  project  is  being  provided  by  the  US  Army 
Medical  Research  and  Materiel  Command.  Suggestions  for  improvement  of  the  Center 
are  welcomed  (see  Feedback). 


Last  Modified: 
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Copyright  ©  2002  VBBR  Consortium  .  All  rights  reserved.  Privacy  Statement 
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Figure  2.  VBBR  WebSite  —  “ 


Task  3.  To  collect  existing  gene  and  genomic  sequences  for  viral  threat  agents  and  import 
them  into  the  database  for  subsequent  annotation  and  analysis  (Months  1-24). 

a.  We  will  obtain  existing  gene  and  genomic  sequences  for  all  viruses  that 
potentially  could  be  used  as  agents  of  biological  warfare  or  terror.  The 
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information  collected  will  include  the  corresponding  descriptive  annotation 
provided  with  the  sequences. 

•  An  automated  parser  has  been  developed  (XSeq  Exchange)  that  can  parse  XML- 
formatted  GenBank  sequence  records  and  their  annotation  and  load  that  information 
into  the  VBBR  database.  This  application  will  greatly  enhance  our  ability  to  add 
new  viral  genomes  to  the  VBBR  database.  The  database  schema  for  the  parser  is 
shown  below  in  figure  3. 
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•  The  genome  and  gene  database  is  currently  populated  with  most  of  the  viruses 
currently  on  the  NIAID  A-C  priority  pathogen  list.  These  are  shown  on  the 
genomes  page  of  the  VBBR  web  site  as  displayed  in  figure  4. 
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Figure  4.  VBBR  Genomes  Page. 
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b.  We  will  also  collect  and  provide  the  references  for  publications  in  the  scientific 
literature  that  describe  studies  on  the  structure  and  function  of  orthopoxvirus 
genes  and  genomes.  We  will  link  this  information  on  our  web  site  to  the 
appropriate  gene  records  in  our  database. 

•  Not  yet  done.  Tasked  for  year  2. 

c.  We  will  initially  obtain  viral  sequence  from  public  databases  such  as  GenBank. 
We  will  also  obtain  unpublished  sequence  information  through  inquires  and 
interactions  with  other  scientists  that  have  been  initiated  by  our  collaborators. 

•  Not  yet  done.  Tasked  for  year  2. 

Task  4.  To  provide  computer-automated  and  human-directed  annotation  to  confirm  and 

update  existing  biological  information  in  the  sequence  records  for  existing  as  well  as 
newly  obtained  sequences.  (Months  6-24) 

•  Computer-automated  annotation  has  been  completed  for  the  viruses  listed  in  figure 
4.  Human  directed  annotation  is  tasked  for  year  2.  An  example  annotation  record  is 
shown  in  figure  5. 

Task  5.  To  perform  a  variety  of  analyses  on  genome  and  gene  sequences  to  provide 

additional  information  on  their  structure,  function,  and  evolution.  (Months  6-24) 

•  Initial  computer-aided  analysis  has  been  completed.  Additional  analyses  tasked  for 
year  2. 


Task  6.  To  provide  through  the  web  site,  a  set  of  analytical  and  visualization  tools  that  will 
allow  users  of  this  resource  to  mine  the  data  for  useful  information,  to  perform 
comparative  analyses  between  these  and  other  viruses,  and  to  better  visualize  and 
assess  the  significance  of  their  results.  (Months  6-24) 

•  Currently  available  analytical  and  visualization  tools  include: 

a.  Genome  map  visualization  (Figure  6.) 

b.  Genome  protein  gene  ortholog  comparisons 

c.  Gene  synteny  visualization  (Figure  7.) 

d.  BLAST  similarity  searches 

e.  BLAST  search  parsing  and  database  storage 

f.  Java-based  Visualization  of  BLAST  search  results  (Figure  8.) 

•  A  new  database  and  web  server  has  been  installed  and  is  now  supporting  all  VBBR 
resources.  This  new  server  provides  added  security  and  has  greatly  increased 
performance  of  the  VBBR  web  site. 

•  Development  of  additional  analytical  and  visualization  tools  are  tasked  for  year  2. 
These  will  include  tools  for  creation  and  visualization  of  multiple  sequence 
comparisons  and  tools  for  the  inference  and  visualization  of  evolutionary  trees. 
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Figure  6.  VBBR  Genome  Map. 
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Figure  7.  VBBR  Gene  Synteny  Map. 


11 


-  '/:s  >'/*??■  ' 

Exit  About 


Sort  By - 

Bit  Score 
Query  From 


Hit  Selection - 


e-Value  Filter - 


PROGRAM:  blastr 

DATABASE :  nr 

QUERY  SEQUENCE :  sars 


m. 


Horizontal  Zoom  - 


Number  of  Nucleotides 


gi  1260075461 ref  KL” CTT.7 

gi | 26008095 | ref 

gi| 22219303 |pdb 

gi[ 25121564 |ref 

gi | 26008087 | ref 

gi|25121551|ref 

gi | 25121568 |ref 

gi  1 25121569  l«f 

gi  1 6625761) gb | A 

gi  1 26008091 (ref 

gi  1 25121571 1 ref 

gi | 26008094 |ref 

gi 1 5565844 1  gb  |  A 

gi|670E916|gb|A 

gi  1 6179905 1 gb | A 

gi  1 11640712) gb | 

BLAST  Viewer  Usage: 

CLICK  or.  EIT  IIAIE  to  view  sequence  record. 
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Figure  8.  VBBR  BLAST  Viewer.  The  figure  shows  the  results  of  a  BLASTX  search  of  the  SARS 
genome  against  the  Genbank  non-redundant  protein  database. 
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KEY  RESEARCH  ACCOMPLISHMENTS: 


•  Establishment  of  the  VBBR  Database 

•  Establishment  of  the  VBBR  Web  Site 

•  Development  of  the  XSeq  Exchange  GenBank  sequence  parser 

•  Population  of  the  VBBR  database  with  virus  genomic  information 

•  Computational  annotation  of  virus  genes 

•  Implementation  of  analytical  and  visualization  tools  for  BLAST  searches 

•  Implementation  of  analytical  and  visualization  tools  for  genome  ortholog  comparisons 

REPORTABLE  OUTCOMES: 

Informatics: 

•  VBBR  Database 

•  VBBR  Web  site  (http://vbbr.genome.uab.edu) 

•  VBBR  analytical  and  visualization  tools 

Presentations: 

•  US AMRIID/WRAIR/L ANL/U AB  Workshop,  August  11, 2003  WRAIR 

•  USAMRMC  Bioinformatics  Workshop,  November  4, 2003  (Poster) 

Funding  proposals  submitted  extending  this  work: 

•  NIH/NIAID  Proposal  in  response  to  the  RFP:  NIH-NIAID-DMID-04-34, 
'Bioinformatics  Resource  Centers  for  Biodefense  and  Emerging/Re-emerging 
Infectious  Diseases" 

CONCLUSIONS: 

Development  of  the  Viral  Biological-threat  Bioinformatics  Resource  (VBBR)  has  substantially 
followed  the  original  task  list  and  essentially  all  tasks  proposed  for  year  1  have  been  accomplished. 
The  result  is  a  Bioinformatics  Resource  Center  that  can  now  begin  to  support  the  needs  of  basic 
and  applied  biodefense  research  directed  at  gaining  a  better  understanding  of  the  pathogenesis, 
evolution,  and  overall  biology  of  viral  pathogens  as  well  as  support  the  development  of  detectors, 
diagnostics,  antivirals,  and  vaccines.  Future  work  will  follow  the  original  task  list  and  is  aimed  at 
providing  human-curated  gene  records  along  with  additional  analytical  and  visualization  tools  that 
can  better  support  understanding  the  role  of  individual  genes  in  virus  pathogenicity. 
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